Glycosyl hydrolase xylanases, compositions and methods of use for efficient hydrolysis and processing of xylan

ABSTRACT

The invention provides a unique subset of GH30 subfamily 8 xylanases (GH30-8) with endo-β-1,4-xylanase activity, compositions comprising an effective amount of the GH30-8 xylanases, methods of synthesis and methods of use thereof.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application No. 62/023,116, filed Jul. 10, 2014, which is incorporated by reference in its entirety for all purposes.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

This invention is owned by and was made with government support from the USDA Forest Service. The government has certain rights in the invention.

BACKGROUND OF THE INVENTION

Xylanases (endo-β-1,4-xylanase, EC 3.2.1.8) hydrolyze internal β-1,4-xylosidic linkages in xylan to produce smaller molecular weight xylose and xylo-oligomers. Xylans are polysaccharides formed from β-1,4-glycoside-linked D-xylopyranoses. Xylanases are very useful in multiple commercial applications, including for dough preparation or bread product preparation fruit and vegetable processing, breaking down agricultural waste, manufacturing animal feed as well as in lignocellulose pretreatment and pulp and paper production. Cellulose and hemicellulose materials which can be converted into fermentable sugars are considered a very useful and under-utilized source of renewable biomass materials. Individual β-1,4-glucose chains once synthesized, self-associate through hydrogen bonding to form semi-crystalline cellulose microfibrils (cellulose). The cellulose microfibrils are embedded in a polysaccharide matrix formed of hemicelluloses such as xylan, galactoglucomannan and xyloglucan all of which may be associated with other lower abundance biomass polysaccharides such as arabinans, mannans; pectins including galacturonans and galactans and various other β-1,3 and β-1,4 glucans all of which are dependent upon the plant source and the cellulosic tissue in question. The hemicellulose matrix is also typically surrounded and cross linked with polyphenolic lignins. From the tight interactions that exist between cellulose, hemicellulose and lignin, it is very difficult and expensive to break down this recalcitrant matrix of biomass to yield desired mixtures of oligosaccharides or fermentable simple sugars.

The primary hemicellulose from hardwood and crop residues is a glucuronoxylan (GX) consisting of a chain of β-1,4-linked xylose residues randomly substituted with α-1,2-linked glucuronic acid (GA) residues. Frequency of substitution has been shown to be as high as 1 GA for every 6 xyloses. In hardwoods, unaltered GX is additionally acetylated to a high degree on the O-2 or O-3 (or O-2 and O-3) hydroxyl positions. Commercial extraction of these polysaccharides typically is done under alkaline conditions which deacetylates these polysaccharides. This is the form of GX commonly used for laboratory studies. Other lower yielding extraction procedures must be implemented to obtain a glucuronoacetylxylan polysaccharide. Xylan is the second most abundant hemicellulose in softwood species next to galactoglucomannan, accounting for just less than half of the total hemicellulose. This source of xylan is in the form of glucuronoarabinoxylan with periodic GA substitutions and α-1,2-linked arabinofuranose substitution on the O-2 or O-3 (or O-2 and O-3) hydroxyl positions of xylose. Xylans from other sources such as grains (wheat, WAX) typically consist of an arabinoxylan, having patterns of arabinofuranose substitution similar to softwoods, but lacking the GA substitution.

The glycosyl hydrolase (GH) family 30 (GH30) enzymes (Cantarel, et al., 2009) have recently been redefined (St. John, et al., 2010). The new family composition consists of eight subfamilies that can be assigned into two structurally and phylogenetically distinguishable groups. Biochemical and structural studies have shown the enzymes in subfamily 8 to have unique characteristics in the degradation of the hemicellulosic polymer glucuronoxylan (St. John, et al., 2006; Vrsanska, et al., 2007; Hurlbert & Preston, 2001).

Accordingly, a need exists for novel glycosyl hydrolase enzymes, compositions and methods of use which can more efficiently convert plant or other cellulosic or hemicellulosic materials into fermentable sugars.

SUMMARY OF THE INVENTION

The present invention provides a functionally unique subset of GH30 subfamily 8 xylanases (GH30-8) with GA-independent endo-β-1,4-xylanase activity, compositions comprising an effective amount of the GH30-8 xylanases of the present invention, methods of synthesis and methods of use thereof.

In a first aspect, the invention encompasses an isolated GA-independent GH30-8 enzyme or variant thereof exhibiting xylanase activity. The enzyme or variant includes the amino acid sequence (W or Y)(W or F)W(I or V or F)(not R)(not R) (SEQ ID NOs:69-80) within the β8-α8 loop of the enzyme or variant.

In an alternate aspect, the invention comprises an isolated GA-independent GH30-8 enzyme or variant thereof exhibiting xylanase activity comprising at least one of SEQ ID NOs: 1-4 in the β7-α7 and β8-α8 loops, wherein the amino acid sequence (W or Y)(W or F)W(I or V or F)(not R)(not R) (SEQ ID NOs:69-80) is within the β8-α8 loop of the enzyme or variant.

In an alternate aspect, the invention comprises an isolated GA-independent GH30-8 enzyme exhibiting xylanase activity or variant comprising an amino acid sequence that is at least 30% identical to an amino acid sequence selected from the group consisting of G7M3Z8, MINOD3, Q97TI2, F7ZYN8, FOKEL6, COIQA1, COIQA2, B3TJG3, E4T705, H1YFT8, and F1TBY8.

In some embodiments, GA-independent GH30-8 enzyme or variant of the invention includes an amino acid sequence that is at least 30% identical to SEQ ID NO:1 residues 33-420 (Q97TI2), SEQ ID NO:2 residues 32-421 (F1TBY8), SEQ ID NO:3 beginning at residue 45 (E4T705), SEQ ID NO:4 beginning at residue 33 (H1YFT8), SEQ ID NO:32 (C0IQA1), SEQ ID NO:33 (B3TJG3), SEQ ID NO:34 (G7M3Z8), SEQ ID NO:35 (M1N0D3), SEQ ID NO:36 (F7ZYN8), or SEQ ID NO:37 (F0KEL6). In some such embodiments, the GA-independent GH30-8 enzyme or variant includes an amino acid sequence that is at least 80% identical to the selected amino acid sequence. In some such embodiments, the GA-independent GH30-8 enzyme or variant includes an amino acid sequence that is at least 95% identical to the selected amino acid sequence.

In some embodiments, the GA-independent GH30-8 enzyme or variant includes the amino acid sequence of SEQ ID NO:1 residues 33-420 (Q97TI2), SEQ ID NO:2 residues 32-421 (F1TBY8), SEQ ID NO:3 beginning at residue 45 (E4T705), SEQ ID NO:4 beginning at residue 33 (H1YFT8), SEQ ID NO:32 (COIQA1), SEQ ID NO:33 (B3TJG3), SEQ ID NO:34 (G7M3Z8), SEQ ID NO:35 (M1N0D3), SEQ ID NO:36 (F7ZYN8), SEQ ID NO:37 (F0KEL6), or the amino acid sequence of COIQA2.

In some embodiments, the GA-independent GH30-8 enzyme is XynQ97.

In some embodiments, GA-independent GH30-8 enzyme is XynC71 (aka CpXyn30A).

In a second aspect, the invention encompasses a GH30-8 enzyme composition. The composition includes a first polypeptide having xylanase activity that includes the amino acid sequence (W or Y)(W or F)W(I or V or F)(not R)(not R) (SEQ ID NOs:69-80), and a second polypeptide different from the first having xylanase activity that also includes the amino acid sequence (W or Y)(W or F)W(I or V or F)(not R)(not R) (SEQ ID NOs:69-80). The GH30-8 enzyme composition is capable of hydrolyzing a lignocellulosic biomass material.

In some embodiments, the composition further includes at least one additional protein having enzymatic activity.

In some embodiments, the first polypeptide, the second polypeptide, or both include an amino acid sequence that at least 30% identical to SEQ ID NO:1 residues 3.3-420 (Q97TI2), SEQ ID NO:2 residues 32-421 (F1TBY8), SEQ ID NO:3 beginning at residue 45 (E4T705), SEQ ID NO:4 beginning at residue 33 (H1YFT8), SEQ ID NO:32 (C0IQA1), SEQ ID NO:33 (B3TJG3), SEQ ID NO:34 (G7M3Z8), SEQ ID NO:35 (M1N0D3), SEQ ID NO:36 (F7ZYN8), or SEQ ID NO:37 (F0KEL6). In some such embodiments, the first polypeptide, the second polypeptide, or both include an amino acid sequence that is at least 80% identical to the selected amino acid sequence. In some such embodiments, the first polypeptide, the second polypeptide, or both include an amino acid sequence that is at least 95% identical to the selected amino acid sequence.

In some embodiments, the first polypeptide, the second polypeptide, or both include an amino acid sequence selected from SEQ ID NO:1 residues 33-420 (Q97TI2), SEQ ID NO:2 residues 32-421 (F1TBY8), SEQ ID NO:3 beginning at residue 45 (E4T705), SEQ ID NO:4 beginning at residue 33 (H1YFT8), SEQ ID NO:32 (COIQA1), SEQ ID NO:33 (B3TJG3), SEQ ID NO:34 (G7M3Z8), SEQ ID NO:35 (M1N0D3), SEQ ID NO:36 (F7ZYN8), SEQ ID NO:37 (F0KEL6), or the amino acid sequence of COIQA2.

In some embodiments, the amount of polypeptides having xylanase activity relative to the total amount of proteins in the enzyme composition is about 10 wt. % to about 20 wt. %.

In a third aspect, the disclosure encompasses a method of hydrolyzing or digesting a lignocellulosic biomass material comprising hemicelluloses, cellulose, or both. The method includes the steps of contacting the GH30-8 enzyme composition described above with the lignocellulosic biomass mixture.

In some embodiments, the lignocellulosic biomass mixture comprises an agricultural crop, a byproduct of a food/feed production, a lignocellulosic waste product, a plant residue, or waste paper.

In some embodiments, the biomass material in the lignocellulosic biomass mixture is subjected to pretreatment, wherein the pretreatment is an acidic pretreatment or a basic pretreatment. In some such embodiments, the pretreatment includes a thermal, aqueous or thermomechanical pulping.

In some embodiments, the GH30-8 enzyme composition is used in an amount and under conditions and for a duration sufficient to convert at least 60% to 90% of the xylan in the biomass material into xylooligosaccharides.

Other objects, features and advantages of the present invention will become apparent after review of the specification, claims and drawings.

INCORPORATION BY REFERENCE

All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, and patent application was specifically and individually indicated to be incorporated by reference.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention will be better understood and features, aspects and advantages other than those set forth above will become apparent when consideration is given to the following detailed description thereof.

The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.

FIG. 1. Exemplified glucuronoxylan processing by canonical GA-dependent GH30-8 xylanases.

FIG. 2. Molecular contacts between aldotriuronate and the active site/substrate binding cleft of the canonical GH30-8 glucuronoxylanhydrolase, XynC from Bacillus subtilis 168.

FIG. 3. A close-up of the glucoronate appendage with the typically conserved β7-α7 and β8-α8 loop regions of XynC of Bacillus subtilis showing the contacts which allow specificity for this appendage in the canonical GH30-8 xylanases.

FIG. 4. An amino acid alignment of diverse GA-dependent GH30-8 xylanase amino acid sequences with similarity to XynC from Bacillus subtilis and XynA from Erwinia chrysanthemi and showing the canonical conserved regions between these enzymes which includes the β7-α7 and β8-α8 loop regions. The sequence labeled Q45070 is SEQ ID NO:5; the sequence labeled V3T509 is SEQ ID NO:6; the sequence labeled S9R6D6 is SEQ ID NO:7; the sequence labeled Q4URA6 is SEQ ID NO:8; the sequence labeled B3PEH7 is SEQ ID NO:9; the sequence labeled U4MR52 is SEQ ID NO:10; the sequence labeled M1MG04 is SEQ ID NO: 11; the sequence labeled R7H6D7 is SEQ ID NO:12; the sequence labeled W4C6X9 is SEQ ID NO:13; the sequence labeled D7C0B1 is SEQ ID NO:14; the sequence labeled M1MLA3 is SEQ ID NO:15; the sequence labeled D3EH02 is SEQ ID NO:16; the sequence labeled U4QXC0 is SEQ ID NO:17; the sequence labeled IOL743 is SEQ ID NO:18; the sequence labeled K2P3H7 is SEQ ID NO:19; the sequence labeled E3E322 is SEQ ID NO:20; the sequence labeled M5RGV9 is SEQ ID NO:21; the sequence labeled A8FDV2 is SEQ ID NO:22; the sequence labeled B4ADC5 is SEQ ID NO:23; the sequence labeled R9TYN3 is SEQ ID NO:24; the sequence labeled M1XAU4 is SEQ ID NO:25; the sequence labeled C9RS22 is SEQ ID NO:26; the sequence labeled F4FBT9 is SEQ ID NO:27; the sequence labeled E8SBF5 is SEQ ID NO:28; the sequence labeled R6LF43 is SEQ ID NO:29; the sequence labeled H1YFT7 is SEQ ID NO:30; the sequence labeled Q46961 is SEQ ID NO:31.

FIG. 5. An amino acid sequence alignment of the GA-independent GH30-8 subset of enzymes being disclosed including the canonical GA-dependent enzymes XynC from Bacillus subtilis and XynA from Erwinia chrysanthemi for comparison revealing how these enzymes show high levels of sequence identity not different than that shown in FIG. 4, yet the β7-α7 and β8-α8 loop regions are conserved. The sequence labeled Q45070 in FIG. 5 is residues 2-389 of SEQ ID NO:5, which ends with amino acids of asparagine (Asn or N) and arginine (Arg or R); the sequence labeled COIQA1 is SEQ ID NO:32; the sequence labeled B3TJG3 is SEQ ID NO:33; the sequence labeled E4T705 is SEQ ID NO:3, beginning at residue 45; the sequence labeled H1YFT8 is SEQ ID NO:4, beginning at residue 33; the sequence labeled G7M3Z8 is SEQ ID NO:34; the sequence labeled M1N0D3 is SEQ ID NO:35; the sequence labeled Q97TI2 is SEQ ID NO:1, residues 33 to 420; the sequence labeled F7ZYN8 is SEQ ID NO:36; the sequence labeled F0KEL6 is SEQ ID NO:37; the sequence labeled F1TBY8 is SEQ ID NO:2, residues 34 to 421; the sequence labeled Q46961 is SEQ ID NO:31, which ends with a lysine (Lys or K).

FIG. 6. Limit hydrolysis product analysis by thin layer chromatography, comparing the hydrolysis products of the GA-independent CpXyn30A (C7I) and CaXyn30A (Q97) enzymes to the canonical GA-dependent GH30-8 xylanase XynC from Bacillus subtilis and a GH10 family xylanase.

FIG. 7A. Small scale sequence alignment highlighting the β7-α7 and β8-α8 region of the GA-dependent GH30-8 enzymes with the GA-independent CpXyn30A (C7I, Cp_F1TBY8) included for comparison. The sequence labeled Bs_Q45070 is SEQ ID NO:5, residues 223-295; the sequence labeled Cs_M1MLA3 is SEQ ID NO:15, residues 223-295; the sequence labeled Cp_F1TBY8 is SEQ ID NO:2, residues 258 to 324; The sequence labeled Ra_E6UFE8 is SEQ ID NO:38; the sequence labeled Cj_B3PEH7 is SEQ ID NO:9, residues 220 to 292; the sequence labeled Xp_F0BZ40 is SEQ ID NO:39; the sequence labeled Ec_Q46961 is SEQ ID NO:31, residues 219 to 288.

FIG. 7B. The native DNA coding sequence of CpXyn30A encodes a protein of 628 amino acids including a N-terminus secretion signal sequence, a C-terminal family 6 carbohydrate binding module and two C-terminal dockerin domains for interaction of the secreted, mature form of CpXyn30A with a cellulosome complex.

FIG. 7C. In support of the qualitative sequence alignment in FIG. 7A, a phylogenetic tree showing that CpXyn30A (Cp_F1TBY8) with its unique sequence in the β7-α7 and β8-α8 loop regions still groups confidently with the canonical GH30-8 enzymes.

FIG. 8A. The crystal structure of CpXyn30A superposed on XynC of Bacillus subtilis establishing a structural relationship.

FIG. 8B. CpXyn30A superposed with XynC from Bacillus subtilis to compare structural differences in the α2 and α3 helix region.

FIG. 8C. CpXyn30A superposed with XynC from Bacillus subtilis to compare the β3-α3 loop region which indicate that this notable structural portion is more similar to GH30-8 enzymes which derive from Gram-positive bacteria.

FIG. 9A. Ligand bound structural model of XynC from Bacillus subtilis superposed with the structure model of CpXyn30A showing how the substrate binding specificity is altered by the changes to the β7-α7 and β8-α8 loops.

FIG. 9B. Superposition of CpXyn30A with XynC from Bacillus subtilis showing the amino acid side chains which are different resulting in a loss of specificity.

FIG. 10A. Thin layer chromatography analysis of hydrolysis products of glucuronoxylan (SGX) and arabinoxylan (WAX) by CpXyn30A compared with XynC from Bacillus subtilis.

FIG. 10B. Thin layer chromatography analysis of hydrolysis products of xylooligosaccharides by CpXyn30A compared to XynC from Bacillus subtilis.

FIG. 11. HPLC analysis of CpXyn30A hydrolysis of xylohexaose showing transglycosylation activity and a flow chart for how this occurs.

FIG. 12. β7-α7 loop regions of the disclosed GH30-8 subset of enzymes being compared to XynC from Bacillus subtilis (top, Q45070_Bsubtili) and XynA from Erwinia chrysanthemi (bottom, Q46961_Dchrysan). The sequence labeled Q45070 is SEQ ID NO:5, residues 222 to 241; the sequence labeled COIQA1 is SEQ ID NO:32, residues 218 to 235; the sequence labeled B3TJG3 is SEQ ID NO:33, residues 214 to 231; The sequence labeled E4T705 is SEQ ID NO:3, residues 255 to 275; the sequence labeled H1YFT8 is SEQ ID NO:4, residues 234 to 252; the sequence labeled G7M3Z8 is SEQ ID NO:34, residues 219 to 236; the sequence labeled M1N0D3 is SEQ ID NO:35, residues 219 to 236; the sequence labeled Q97TI2 is SEQ ID NO:1, residues 253 to 270; the sequence labeled F7ZYN8 is SEQ ID NO:36, residues 221 to 238; the sequence labeled F0KEL6 is SEQ ID NO:37, residues 221 to 238; the sequence labeled F1TBY8 is SEQ ID NO:2, residues 257 to 274; the sequence labeled Q46961 is SEQ ID NO:31, residues 218 to 235.

FIG. 13. β8-α8 loop regions of the disclosed GH30-8 subset of enzymes being compared to XynC from Bacillus subtilis (top, Q45070_Bsubtili) and XynA from Erwinia chrysanthemi (bottom, Q46961_Dchrysan). The sequence labeled Q45070 is SEQ ID NO:5, residues 263 to 278; the sequence labeled COIQA1 is SEQ ID NO:32, residues 253 to 270; the sequence labeled B3TJG3 is SEQ ID NO:33, residues 249 to 268; The sequence labeled E4T705 is SEQ ID NO:3, residues 292 to 311; the sequence labeled H1YFT8 is SEQ ID NO:4, residues 269 to 287; the sequence labeled G7M3Z8 is SEQ ID NO:34, residues 254 to 270; the sequence labeled M1N0D3 is SEQ ID NO:35, residues 254 to 270; the sequence labeled Q97TI2 is SEQ ID NO:1, residues 288 to 304; the sequence labeled F7ZYN8 is SEQ ID NO:36, residues 256 to 272; the sequence labeled F0KEL6 is SEQ ID NO:37, residues 256 to 272; the sequence labeled F1TBY8 is SEQ ID NO:2, residues 292 to 307; the sequence labeled Q46961 is SEQ ID NO:31, residues 256 to 271.

FIG. 14. SEQ ID NO: 1. Amino acid sequence of UniProt accession Q97TI2 having the named protein expression product Q97 (aka XynQ97, CaXynQ97, CaXyn30A, Q97_RCN and XynQ97_RCN).

FIG. 15. SEQ ID NO: 2. Amino acid sequence of UniProt accession F1TBY8 having 100% identity to the obsolete UniProt accession C7IMC9 and having the named protein expression product C7I (aka XynC7I, CpXynC7I and CpXyn30A).

FIG. 16. SEQ ID NO: 3. Amino acid sequence of UniProt accession E4T705 and having the named protein expression product PpXyn30A.

FIG. 17. SEQ ID NO: 4. Amino acid sequence of UniProt accession H1YFT8 and having the named protein expression product MpXyn30A.

FIG. 18. The β2-α2 loop region of CpXyn30A showing a fine difference between the structure of this enzyme and the canonical GH30-8 subfamily of enzymes.

FIG. 19. SDS-PAGE analysis of additional GA-independent GH30-8 xylanases and the full CaXyn30A (Q97_RCN) xylanase containing the natively encoded C-terminal CBM13 module.

FIG. 20. TLC analysis of additional GA-independent GH30-8 xylanases and the full CaXyn30A (Q97_RCN) xylanase containing the natively encoded C-terminal CBM13 module along with the originally studied Q97 catalytic domain.

FIG. 21A. Schematic representing the xylosyl binding subsites of an endo-xylanase active site detailing the locations along the xylan chain where the enzyme may accommodate or require (as in the GA-dependent GH30-8 xylanases) a GA substitution. Example of GX within the active site of a GH10 xylanase showing GA accommodating xylose subsites and therefore a deduced limit product of aldotetrauronate.

FIG. 21B. Example of GX within the active site of a GH30-8 GA independent xylanase showing GA accommodating xylose subsites and therefore a deduced limit product of aldotriuronate.

DETAILED DESCRIPTION OF THE INVENTION I. In General

Before the present materials and methods are described, it is understood that this invention is not limited to the particular methodology, protocols, materials, and reagents described, as these may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention.

It must be noted that as used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural reference unless the context clearly dictates otherwise. As well, the terms “a” (or “an”), “one or more” and “at least one” can be used interchangeably herein. It is also to be noted that the terms “comprising”, “including”, and “having” can be used interchangeably.

Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods and materials are now described. All publications and patents specifically mentioned here are incorporated by reference for all purposes including describing and disclosing the chemicals, instruments, statistical analysis and methodologies which are reported in the publications which might be used in connection with the invention. All references cited in this specification are to be taken as indicative of the level of skill in the art. Nothing herein is to be construed as an admission that the invention is not entitled to antedate such disclosure by virtue of prior invention.

II. The Invention

The present invention provides a functionally unique subset of GH30 subfamily 8 xylanases (GH30-8) with endo-β-1,4-xylanase activity, compositions comprising an effective amount of the GH30-8 xylanases of the present invention, methods of synthesis and methods of use thereof.

GH30-8 Xylanases.

Glycoside hydrolase enzymes defined in family 30 (GH30) have at least 8 subfamilies (GH30-1 through GH30-8). Enzymes classified into the GH30-8 subfamily constitute a well characterized group of endoxylanases which cleave the β-1,4-xylosidic linkage of xylan only upon recognition of the α-1,2-linked 4-O-methylglucuronic acid (glucuronic acid, GA) side chain appendage common to many xylan types and sources (St. John, et al., 2010; St. John, et al., 2006). Cleavage of the xylan chain occurs toward the polymer reducing terminus relative to the target glucuronic acid, such that the GA appendage is positioned penultimate to the new reducing terminus.

Limit hydrolysis of glucuronoxylan by these “GA-dependent” xylanases primarily result in a distribution of aldouronates in which each contains a single GA appendage substituted on the xylose penultimate to the reducing terminal of the resulting aldouronate (FIG. 1). The way in which these enzymes function to perform this specific type of glucuronoxylan chain cleavage has been described through biochemical studies and more recently from a detailed understanding of the protein structure with associated xylan derived ligands (FIGS. 2 & 3) (St. John et al., 2011; Urbanikova et al., 2011).

Before the enzymes of the present invention were identified and pursued, it was strongly established and widely agreed upon that GH30-8 enzymes were restricted to cleaving the β-1,4-linkage of xylan next to a GA substitution (FIGS. 1, 21). Inherent in that understanding is that the mechanism which allowed for that to occur, if it were to not be what it is, then the enzyme may no longer function. Thus, these canonical GA-dependent GH30-8 xylanases, therefore are known to require GA for their function.

The presently disclosed, functionally distinct subset of GH30-8 xylanases and compositions thereof have a relaxed (or expanded range of) substrate specificity which results in a gain of function, because to function, they do not require the O-2 linked GA. These “GA-independent” xylanases are thus able to hydrolyze diverse polymeric xylans, including glucuronoxylans (GX) such as sweetgum wood xylan (SGX) and beech wood xylan (BX), arabinoxylans such as wheat arabinoxylan (WAX) and neutral xylooligosaccharide (e.g. X6) to smaller xylooligosaccharides and substituted xylooligosaccharides. Such classes of compositions cannot be hydrolyzed by the typical GA-dependent GH30-8 xylanases, which require the O-2 linked glucuronic acid (FIG. 6).

Specifically, the GH30-8 GA-independent xylanases of the present invention comprise any protein which, through primary sequence analysis, contains a confidently classified GH30-8 catalytic module with family conserved catalytic amino acids identified, as a part, or whole of a mature amino acid sequence, but having an altered sequence in place of the functionally characterized, GH30-8 subfamily conserved β7-α7 and β8-α8 loops of interest, as found in the XynQ97 (CaXyn30A), XynC7I (CpXyn30A), PpXyn30A and MpXyn30A xylanases and described further herein. In one embodiment, the specific amino acid sequence in the Pβ7-α7 and β8-α8 loops is as shown in SEQ ID NO: 1 (FIG. 5, Label: Q97TI2_Cacetobu & FIG. 14). The invention also includes those confidently classified GH30-8 (as described above) xylanases comprising at least 30% identity, at least 40% identity, at least 50% identity, at least 60% identity, at least 70% identity, at least 80% identity, at least 90% identity, at least 95% identity, or at least 98% identity to the specific amino acid sequence in the β7-α7 and β8-α8 loop regions of one or more of SEQ ID NO:1, SEQ ID NO:2 (C7I), SEQ ID NO:3 (PpXyn30A) or SEQ ID NO:4 (MpXyn30A).

While the GA-independent GH30-8 xylanases are different in the Jβ7-α7 and β8-α8 loop regions relative to the conserved sequence of these loops in the GA-dependent GH30-8 xylanases, these sequences are also notably diverse within this GH30-8 subset of xylanases (FIGS. 5, 12 and 13), highlighting a likelihood that they may all function uniquely. The function of GA-dependent xylanases appears primarily attributable to the conserved β8-α8 loop sequence WW(YF)(IGL)(RK)R(SQYFC)Y(GS) (RR-motif) (as ascertained from the diverse alignment provided in FIG. 4). In the GH30-8 GA-independent xylanases the conserved RR-motif sequence in this loop is replaced with the sequence (WY)(WF)W(IVF)(not R)(not R) (SEQ ID NOs:69-80), as ascertained from the GA-independent GH30-8 alignment provided in FIG. 5). Accordingly, the invention encompasses GA-independent GH30-8 xylanases or variants thereof comprising the β8-α8 loop sequence (WY)(WF)W(IVF)(not R)(not R) (SEQ ID NOs:69-80).

Indeed, the CpXyn30A (C7I) xylanase performed very similar to the CaXyn30A (Q97) with respect to the final hydrolysis products detected by TCL, but the measured rate of hydrolysis was notably low. It is not clear from our current level of analysis whether the CpXyn30A yielded a portion of larger oligosaccharides as observed in the TLC (FIG. 6) for both beech wood xylan and wheat arabinoxylan because of this comparably low activity, or because of the unique sequence differences between the β7-α7 and β8-α8 loop regions of these two GA-independent GH30-8 xylanases. Likewise, the PpXyn30A enzyme with yet a different sequence representing its β7-α7 and β8-α8 loops produced very similar results to both CpXyn30A and CaXyn30A (FIG. 20). However, the MpXyn30A enzyme, while clearly functioning as an endoxylanase yielding larger neutral xylooligosaccharides such as xylotriose and xylotetraose failed to result in any significant hydrolysis (barely detectable) yet is still shown to be GA-independent (FIG. 20).

The GA-independent GH30-8 xylanases of the present invention hydrolyze GX to provide a series of small neutral xylooligosaccharides and aldouronates (FIGS. 6, 10A, 10B and 20), which is very different from the aldouronate ladder produced by the GA-dependent GH30-8 xylanases (FIGS. 6, 10A and 10B). Additionally the GX hydrolysis product profile for the GA-independent GH30-8 xylanases of the present invention are more similar, but unique to GX hydrolysis product profiles generated by the common GH10 and GH11 endo-β-1,4-xylanases. None of the GA-independent GH30-8 xylanases result in xylose as a primary hydrolysis product as observed for some GH10 xylanases. Also, unlike the GH10 and GH11 xylanases which produce, as their primary aldouronate limit product of GX hydrolysis, the tetrameric product aldotetrauronate and aldopentauronate, respectively, the GA-independent GH30-8 enzymes are potentially able to liberate aldotriuronate as the smallest limit product (FIGS. 21A-B).

Since hydrolysis of arabinoxylan by GA-dependent GH30-8 xylanases does not occur (FIG. 6), the hydrolysis product profile obtained with the GA-independent GH30-8 xylanases on this substrate is very unique. Further, these enzymes appear to be very efficient at the liberation of difficult to reduce (hydrolyze) arabinoxylan substrates, relative to GH10 xylanases (FIG. 6). For this substrate, the tested GA-independent GH30-8 xylanases produce unique hydrolysis product profiles.

Rationalization and our results support the likelihood that the GA-independent GH30-8 xylanases are better at liberating small substituted xylooligosaccharides and substituted xylooligosaccharides from highly substituted regions of xylans (FIGS. 21A-B). The current state-of-the-art is that GH10 xylanases produce the smallest hydrolysis products. The primary limit product of these xylanases is aldotetrauronate. This is because, as detailed in FIG. 21A the enzyme can accommodate the O-2 substituted GA moiety only in the +1 and −3 subsites within the substrate binding cleft. In comparison to these GH10 xylanases and, as detailed in FIG. 21B, for GA-dependent GH30-8 xylanases the −2 subsite is responsible for the coordination of a GA substituted xylose. Without this, hydrolysis is known not to occur. In these enzymes, although the GA is specifically “bound” in this position, the general observation is that the GA moiety is extended upward out of the catalytic cleft and therefore not into the enzyme where steric interaction might prevent hydrolysis. Notably, based on limit product studies of GX by these GA-dependent GH30-8 xylanases, the smallest aldouronate that might be produced (depending upon the nature of the specific GX being analyzed) is aldotriuronate. This would indicate that in these xylanases, a GA substitution can also be accommodated on the xylose in the +1 subsite (see FIGS. 21A-B). Based on xylan chain bonding and the reported position within xylan binding enzymes, it appears very likely that the xylose in this subsite would present the O-2 hydroxyl upward out of the substrate binding cleft. A GA substituted in this position would likely therefore be accommodated.

Extending this rationale to the GA-independent GH30-8 xylanases of the present invention, it would than seem possible that, barring changes to the +1 subsite region, both the +1 and −2 subsites may accommodate substituted GA moieties. This is verified with the CaXyn30A (Q97) xylanase-(FIG. 6), as the sugar aldotriuronate is clearly visible following a limit digestion. The extended range of function of these enzymes is also likely to benefit hydrolysis of the more highly substituted (with arbinofuranose) wheat arabinoxylan substrate. This is confirmed by results in the hydrolysis of this substrate (FIG. 6), where compared to the GH10 xylanase CaXyn30A, it effectively converted the entire starting amount of xylan to xylooligosaccharides and substituted xylooligosaccharides.

In one embodiment, the GA-independent, GH30-8 xylanases of the present invention represent eight (upon last count) nonredundent sequences out of several hundred in the UniProt protein database which through primary sequence analysis are confidently classified as GH30-8 xylanases. However, in each of these the sequence of the β7-α7 and β8-α8 loop regions is completely different than the canonical sequence found in the GA-dependent GH30-8 xylanases and also unique within the disclosed subset as described above. These enzymes are shown to have a loss of GX substrate specificity and expanded function, providing unique xylan hydrolysis product profiles relative to the GA-dependent GH30-8 xylanases and xylanases from other xylanase enzyme families and also are proposed to be more efficient in the hydrolysis of highly substituted polymeric xylan for reasons presented throughout.

In one embodiment, by “GA-independent GH30-8 xylanases” we mean the isolated enzymes having xylanase activity comprising amino acid sequences from Clostridium (UniProt accession numbers G7M3Z8, MINOD3, Q97TI2 {Q97}, F7ZYN8 and FOKEL6); the southern root knot nematode Meloidogyne incognita (UniProt accession numbers COIQA1 and COIQA2); the plant pathogenic nematode Radopholus simitis (UniProt accession number B3TJG3); the bacterium Paludibacter propionicigenes (UniProt accession number E4T705); the bacterium Mucilaginibacter paludis (UniProt accession number H1YFT8); and from Clostridium papyrosolvens (UniProt accession number F1TBY8 {C7I}).

In one embodiment, the GH30-8 enzymes are CpXyn30A (also referred to throughout as C7I, XynC7I and CpXynC7I) and CpXynQ97 (also referred to throughout as Q97 and XynQ97 and CaXynQ97).

By “isolated enzyme” we mean polypeptides isolated from other cellular proteins, purified and recombinant polypeptides, cellular material, viral material, chemical precursors or other chemicals.

TABLE 1 Comparison of amino acid identity levels of the GH30-8 GA- independent xylanases of the present invention to the characterized GH30-8 GA-dependent xylanases BcXynC and EcXynA. Level of Identity to the Canonical GH30-8 Xylanases (%)¹ XynC XynA XynQ97 (UniProt: (UniProt: (UniProt: 45070) Q46961) Q97TI2)² Q45070_Bsubtilis (BsXynC) 100 40.4 40.9 C0IQA1_Mincognita 34.3 34.6 38.9 B3TJG3_Rsimilis 38.8 45.0 47.2 E4T705_Ppropionicigenes 37.5 41.6 42.1 (PpXyn30A)² H1YFT8_Mpaludis (MpXyn30A)² 38.2 35.4 39.7 G7M3Z8_Clostridium sp. 43.6 43.5 65.6 M1N0D3_Csaccharoperbu- 40.1 43.5 68.5 tylacetonicum Q97T12_Cacetobutylicum 40.9 40.4 100 (CaXynQ97)^(2,3) F1TBY8_Cpapyrosolvens 54.5 36.1 49.6 (CpXynC7I)² Q46961_Dehrysanthemi (EcXynA) 40.9 100 40.4 ¹Comparative analysis was performed with the sequence shuffling tool PRSS available at http://fasta.bioch.virginia.edu/fasta_www2/fasta_www.cgi?rm=shuffle. ²XynQ97, XynC7I, PpXyn30A and MpXyn30A are the four GH30-8 subset xylanases being used to represent the disclosed GH30-8 xylanases of the present invention in this application and are included for comparative reasons. ³Sequnces with UniProt accession numbers F7ZYN8 and F0KEL6 are not included as they are 100% identical to UniProt accession number Q97TI2 and are therefore redundant.

Compositions Comprising the GH30-8 Xylanases of the Present Invention.

The present invention provides a composition comprising an effective amount of at least one GA-independent GH30-8 xylanase that is capable of breaking down lignocellulose material. The enzyme composition of the invention may comprise a multi-enzyme blend, comprising more than one enzymes or polypeptides of the present invention. The GH30-8 xylanase subset composition of the invention can suitably include one or more additional enzymes derived from other microorganisms, plants, or organisms. Synergistic enzyme combinations and related methods are contemplated. One skilled in the art can readily identify the optimum ratios of the GH30-8 enzymes to be included in the enzyme compositions for degrading various types of lignocellulosic materials to contribute to efficient conversion of various lignocellulosic substrates to their constituent fermentable sugars in the case of conversion of polymeric sugars to monomers and to the desired oligomeric xylooligosaccharide mixture if that is desired. Assays known to the art may be used to identify optimum proportions/relative weights of the GH30-8 xylanases in the enzyme compositions, with which various lignocellulosic materials are efficiently hydrolyzed or broken down in saccharification processes.

In one embodiment, the invention comprises a composition comprising an effective amount of at least one of the novel GA-independent GH30-8 xylanases of the present invention. By “effective amount” we mean an amount sufficient to catalyze or aid the digestion or conversion of hemicellulose materials in lignocellulosic polysaccharide containing substrates to fermentable sugars or to a desired xylooligosaccharide composition and or to obtain a desired quality in the remaining xylan containing materials providing the full or partial removal of xylan. In one embodiment, an “effective amount” comprises the amount required to convert polymeric xylan, under ideal conditions, to the limit xylooligosaccharides, with depletion of the polymer in a given period of time. In one example, the combined weight of the novel GH30-8 xylanase subset of the present invention having xylanase activity as measured by HPLC or biochemical reducing sugar assays can constitute about 0.05 wt. % to greater than 99 wt. % (e.g., about 0.05 wt. % to about 70 wt. %, about 0.1 wt. % to about 60 wt. %, about 1 wt. % to about 50 wt. %, about 10 wt. % to about 40 wt. %, about 20 wt. % to about 30 wt. %, about 2 wt. % to about 45 wt %, about 5 wt. % to about 40 wt. %, about 10 wt. % to about 35 wt. %, about 2 wt. % to about 30 wt. %, about 5 wt. % to about 25 wt. %, about 5 wt. % to about 10 wt. %, about 9 wt. % to about 15 wt. %, about 10 wt. % to about 20 wt. %, etc) of the total proteins in the enzyme composition.

In one embodiment, the enzyme compositions desirably comprise mixtures of 2 or more, 3 or more, 4 or more, or even 5 or more GH30-8 xylanases of the invention as defined above that can catalyze or aid the digestion or conversion of hemicellulose materials to a desired oligosaccharide mixture, their final xylooligosaccharide mixture or to fermentable sugars. It is expected that members of the GH30-8 subset, may function synergistically with xylanases of other enzyme families (including the GA-dependent GH30-8 xylanases), to more efficiently degrade xylan. Suitable xylanases include those having at least equal to or greater than 30% (e.g., at least about 30%, 35%, 40, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%) identity to SEQ ID NOs: 1-4, over a region of at least about 0.10 (e.g., at least about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400) residues.

The GH30-8 xylanase composition of the present invention may also comprise an effective amount of at least one of the novel GH30-8 enzymes of the present invention and at least a second additional enzyme having enzymatic activity. For example, the composition may include enzymes having xylosidase activity, cellobiohydrolase activity, β-glucosidase activity, cellulase activity, β-xylosidase activity, arabinofuranosidase activity, lytic polysaccharide monooxygenase activity, lyase activity or endoglucanase activity.

The GH30-8 xylanase compositions of the present invention can suitably further comprise one or more accessory proteins, such as, for example and without limitation, mannanases such as endomannanases, exomannanases, and 6-mannosidases; galactanases such as endo- and exo-galactanases; arabinases such as endo-arabinases and exo-arabinases; ligninases; amylases; α-glucuronidases; proteases; esterases such as ferulic acid esterases, acetyl xylan esterases, coumaric acid esterases or pectin methyl esterases; lipases; other glycoside hydrolases; xyloglucanases; CIP1; CIP2; swollenins; expansins; and cellulose disrupting proteins such as cellulose binding modules; other xylanases, pectate lyases and arabinofuranosidases; stabilizers known to the art. Additives include, without limitation, any combination of sugars (e.g. maltose, glycerol), sugar alcohols (e.g. sorbitol), detergents (usually nonionic) or thickeners and cryoprotectants (e.g. glycerol, propylene glycol, polyethylene glycol).

The other enzymes or proteins of the composition of the current invention can be isolated or purified from a naturally-occurring source, or expressed or overexpressed by a recombinant host cell. They may be added to an enzyme composition in an isolated or purified form. They may be expressed or overexpressed by a host organism or host cell as part of a culture mixture, for example a fermentation broth.

The GH30-8 enzyme compositions of the present invention are used or are useful for producing metabolizable simple sugars in conjunction with other xylan accessory enzymes, such as α-glucuronidases, α-arabinofuranosidases, esterases and xylosidases. The GH30-8 subset xylanases of the present invention, by themselves will generate xylose only as a nonspecific low rate side reaction of an already small xylooligosaccharide. The xylooligomeric sugar mixture produced by these enzymes can best be viewed in the TLC data of the Q97, C7I, PpXyn30A and MpXyn30A proteins (FIGS. 6, 10A-B and 20).

The GH30-8 enzyme compositions of the present invention are also used or are useful for reducing hemicellulosic polymers and cellulosic polymers (when synergistically applied with cellulase enzymes or lignocellulose or cellulase disrupting proteins such as CBMs, expansins or swollenins or lytic polysaccharide monooxygenases) into metabolizable carbon moieties. The enzyme composition is suitably in the form of a product of manufacture, such as a formulation, and can take the physical form of a liquid or a solid.

Methods of Synthesis.

As described in the examples below, protein expression and purification scheme is done as known in the art. The cells are grown in preparation of protein expression and during protein expression and are optimized as necessary to increase protein yield. Such changes that might be considered for enhanced protein expression include: the specific treatment of the cells during the inoculant growth, the specific conditions of the inoculation such as the quantity of cells used, the starting amount of antibiotic selection, culture growth (and/or protein expression) temperature, the OD 600 nm measure when induction begins, whether IPTG or lactose is used to induce expression of the lac operator, The concentration of the inductant, oxygen availability during growth and more importantly during protein expression, and the length of time provided for expression. The current systems of expression are primarily based upon guidance provided by the pET System manual from Novagen (10^(th) Edition) for use with IPTG/lactose inducible lac operators in the Gram-negative bacterium Escherichia coli. The protein coding region for these GH30-8 GA-independent xylanases of the present invention might also be expressed from any number of other inducible expression vectors or even nuninducible, leaky vectors. Other vectors include expression that is inducible with other sugars such as arabinose, mannose or other chemicals or those that are responsive to physical changes such as exposure to any wavelength of light, temperature change or chemical shock.

Using any of the protein expression systems described above, the methods employed for protein expression could also be altered to perform as an autoinduction system. Yields of these enzymes might also increase through the use of other protein expression hosts such as, but not limited to other bacteria, yeast, fungi, plants and insects. Although not expected to increased yields, these enzymes might also be produced through in-vitro synthesis or through purification of the desired enzyme from the native source organism.

METHODS OF USE

The GH30-8 xylanases and compositions of the present invention can be applied in any industry for the reduction of xylans to xylooligosaccharides and novel mixtures of substituted xylooligosaccharides or for the production of desired product characteristics resulting from the removal, partial removal, limited disruption or modification of xylan fraction. By “xylans” we mean a β-1,4-linked xylose polysaccharide which is the primary hemicellulose of hardwoods and crop residues and the second most abundant carbohydrate polymer in lignocellulosic biomass. Other forms of this hemicellulosic polysaccharide can also be found in grain derived food products and in fruit products. The source of the xylan polysaccharide typically defines its chemical characteristics in terms of chain length and sugar and non-sugar substitutions along the xylan chain. The nature of the substitutions along the xylan chain define various xylan types including, glucuronoxylan, acetylglucuronoxylan, acetylglucuronoarabinoxylan, glucuronoarabinoxylan and arabinoxylan, all generally referred to as “xylans”.

The GH30-8 xylanase and compositions thereof of the present invention can be used for hydrolyzing, breaking up, or disrupting all xylans or xylan-comprising compositions. In one embodiment, the method comprises contacting the xylan or xylan-comprising composition with the GH30-8 GA-independent subset of xylanases or enzyme composition of the present invention under suitable conditions, wherein the GH30-8 subset or enzyme composition of the present invention hydrolyzes, breaks up or disrupts the xylan or xylan-comprising composition.

The GH30-8 xylanases of the present invention and compositions thereof used in such a process may comprise, for example, a 0.1 g to 100 g (e.g., 2 g to 20 g, 3 g to 7 g, 1 g to 5 g, or 2 g to 5 g) of polypeptides having xylanase activity per kg of hemicellulose in the biomass material. The GH30-8 xylanases of the present invention and compositions thereof may be applied in conjunction with other enzymes for complete enzymatic degradation of xylans to xylan-constituent monosaccharides or by itself to produce complex xylooligosaccharide mixtures or otherwise facilitate the processing of the xylan fraction of lignocellulosics.

The GH30-8 xylanase and compositions of the present invention can also be used to digest xylans from any source, including all biological sources, such as plant biomasses, including, but not limited to, corn, grains, sugarcane, grasses (Indian grass, such as Sorghastrum nutans; or, switchgrass, e.g., Panicum species, such as Panicum virgatum), perennial canes (e.g., giant weeds), woods or wood processing byproducts, e.g., in the wood processing, pulp and/or paper or nanocellulose and nanofibrilated cellulose industry, in textile manufacturing, in household and industrial cleaning agents, and in biomass waste processing; for the processing or preparation of dough or bread based products and in animal feed for application to enhance animal nutrition and feed digestion and possibly for the synthesis of larger xylooligosaccharides as exemplified in Example 1.

The GH30-8 xylanases of the present invention and compositions thereof (including enzymes or designed enzyme compositions) can also comprise at least one biomass material. By “biomass material” we mean any material comprising a lignocellulosic material derived from an agricultural crop or byproduct of a food or feed production. Suitable biomass material can also include lignocellulosic waste products, waste paper or waste paper products, plant residues comprising grains, seeds, stems, leaves, hulls, husks, corncobs, corn stover, grasses, straw, reeds, wood, wood chips, wood pulp, or sawdust. Exemplary grasses include, without limitation, Indian grass or switchgrass. Exemplary reeds include, without limitation, certain perennial canes such as giant reeds. Exemplary paper waste include, without limitation, discarded or used photocopy paper, computer printer paper, notebook paper, notepad paper, typewriter paper, newspapers, magazines, cardboard and paper-based packaging materials.

The GH30-8 xylanase of the present invention and compositions thereof (including enzymes or designed enzyme compositions, such as products of manufacture or a formula) are useful for hydrolyzing hemicellulosic materials, catalyzing the enzymatic conversion of suitable biomass substrates to mixtures of complex oligomeric sugars or fermentable simple sugars.

Methods of using or applying the GH30-8 xylanases and compositions thereof in a research setting, an industrial setting, or in a commercial setting are also provided. The GH30-8 xylanases of the present invention and compositions thereof may be added as a desired mass of dry powder or as a desired volume of concentrated or diluted aqueous or non-aqueous solution to xylan containing materials incubated at a desired temperature, which may include temperatures which are optimal or not optimal (could be too hot (slow or fast inactivation through denaturation) or too cold (non-optimal activity)) for the GH30-8 xylanases of the present invention. The application of the GH30-8 xylanases of the present invention and compositions thereof may continue until such time as the desired outcome is achieved. For application as an additive to animal feed, the designated amount of GH30-8 xylanases of the present invention or compositions thereof will be added to, at the desired proportion, a xylan containing biomass material or to a non-xylan containing material intended for animal consumption. Following addition, this animal feed material will be processed in a manner consistent with significant or acceptable GA-independent GH30-8 xylanase activity recovery for the anticipated application within the animals digestive tract.

It is also considered that these GH30-8 xylanases of the present invention and compositions thereof may be applied through surface treatments of biomass products and items for the alteration of wood surface physical, chemical or textural properties. Using the GH30-8 xylanases and compositions of the present invention to remove or reduce xylans in biomass preferably yields 50% to 90% xylobiose and an array of xylooligosaccharides of the enzyme accessible xylan.

In addition to reducing xylans in biomass to xylooligosaccharides and sugars, the GH30-8 xylanases and compositions of the present invention can be used in industrial, agricultural, human food and animal feed, as well as a human food and animal feed supplementation. There is an ever increasing interest for the use of lignocellulosic biomass to make products and fuels to increase our use of renewable resources, and for reduction of greenhouse gas emissions. The GH30-8 xylanases and compositions of the present invention may find applications in any lignocellulose based process in which the xylan component is treated for removal, or for subsequent use in the form of xylooligosaccharides, substituted xylooligosaccharides, oligosaccharides of other polymeric sugars or monosaccharides of any lignocellulose derived sugar. For instance, the GH30-8 xylanases and compositions of the present invention may find applications in wood, paper and pulp treatments, treating fibers and textiles, treating foods and food processing, animal feed supplementation and food or feed or food additives, reducing the mass and volume of substantially untreated solid waste, detergent, disinfectant or cleanser (cleaning or cleansing) compositions. There is also interest for use of such enzymes in processing of dough and in the preparation of other foods and beverages and in the preparation of prebiotics from both cereal grains and lignocellulosic biomass as food additives and nutraceuticals.

While multiple embodiments are disclosed, still other embodiments of the present invention will become apparent to those skilled in the art from the following detailed description. As will be apparent, the invention is capable of modifications in various obvious aspects, all without departing from the spirit and scope of the present invention. Accordingly, the detailed description of the novel compounds and methods of the present invention are to be regarded as illustrative in nature and not restrictive.

III. Examples

The invention will be more fully understood upon consideration of the following non-limiting Examples. The invention has been described in connection with what are presently considered to be the most practical and preferred embodiments. However, the present invention has been presented by way of illustration and is not intended to be limited to the disclosed embodiments. Accordingly, those skilled in the art will realize that the invention is intended to encompass all modifications and alternative arrangements within the spirit and scope of the invention as set forth in the appended claims.

Example 1. Structural and Biochemical Characterization CpXyn30A

In this example, we present the structural and biochemical characterization of a novel enzyme that possesses a high degree of amino acid identity to the canonical GH30-8 enzymes, but lacks the hallmark β8-α8 loop region which defines the GH30-8 subfamily of xylanases. The three-dimensional structure of this unique GH30 subfamily 8 homolog was determined using x-ray crystallographic methods and provide functional characterization of the enzyme with comparisons to the canonical GH30 subfamily enzyme XynC from Bacillus subtilis (BsXynC).

In light of the role of the β8-α8 loop region in imparting functional specificity to the GH30-8 subfamily, amino acid sequence studies were implemented to identify homologs which possess sequence differences in this region. A putative xylanase (UniProt ID: FITBY8; referred to as xylanase 30A) derived from the lignocellulose degrading bacterium Clostridium papyrosolvens (CpXyn30A) was identified and chosen for study.

DNA Synthesis and Protein Expression.

The sequence for UniProt ID: C7MC9 (also known as F1TBY8) was identified for this study and the expression-optimized coding sequence (Welch et al., 2009) including an C-terminal HisTag was ordered from DNA 2.0 (Menlo Park, Calif.) in the kanamycin-resistant pJexpress 411 expression vector. Plasmid DNA was used to transform Escherichia coli for expression. Cells were grown with shaking at 37° C. in Luria-Bertani broth supplemented with 0.03 mg/mL kanamycin until they reached an optical density at 600 nm of 0.6. The cells were induced by the addition of isopropyl-β-D-thiogalactopyranoside to a final concentration of 0.5 mM and incubated with shaking at 250 RPM for five hours at 37° C. Cells were harvested by centrifugation at 7500 RPM for 20 minutes at 4° C. The resulting pellet was suspended in 50 mM sodium phosphate, 100 mM NaCl, pH 7.2 at a ratio of 5 mL per gram of cell pellet. A 1 μL aliquot of 1× Halt protease inhibitor (Thermo Fisher, Rockford, Ill. USA) was added for every 1 mL of buffer used. Suspended cells were lysed using sonication and the lysate was centrifuged at 11,000 RPM for 30 minutes at 4° C. The resulting supernatant was dialyzed overnight at 4° C. against 50 mM sodium phosphate, 500 mM NaCl, 10 mM imidazole, pH 7.2, using 10,000 Da MWCO dialysis tubing.

Purification.

The post dialysis material was centrifuged to remove any precipitate and then filtered through a 0.22 m filter to further remove debris. The filtered solution was then loaded onto a 1 mL HisTrap fast flow column (GE Healthcare Life Sciences Pittsburgh, Pa.) charged with Ni²⁺ and the hexahistidine-tagged recombinant protein was eluted with a linear gradient of 0-500 mM imidazole in 50 mM sodium phosphate, 250 mM NaCl, pH 7.2. Peak fractions were analyzed by SDS-PAGE (Laemmli, 1970). Fractions containing the protein were combined and dialyzed against 50 mM sodium phosphate, 250 mM NaCl, 10 mM imidazole, pH 7.2, overnight at 4° C. using 10,000 Da MWCO dialysis tubing. A second purification step using a 1 mL HisTrap column charged with Co²⁺ was then employed with the same elution scheme as above. A single large peak was obtained from this run and peak fractions were analyzed by SDS-PAGE to ascertain purity and size. Fractions containing the purified protein were combined and dialyzed into 20 mM HEPES, 150 mM NaCl, pH 7.2. In a separate preparation meant solely for biochemical studies, CpXyn30A was purified as described, but the final purified protein was dialyzed against 30 mM Tris HCl, 50 mM NaCl, pH 7.5. After dialysis, the protein solution was concentrated to at least 5 mg/mL and stored at −80° C. until crystallization or functional studies. The GH30-8 xylanase, BsXynC, was purified as previously described (St. John et al., 2011; St. John et al., 2006). Protein concentration was routinely determined using absorbance at 280 nm with the ProtParam predicted extinction coefficient (Gasteiger et al., 2005).

Crystallization.

The sparse matrix screens Crystal Screen, Crystal Screen 2, Index and PEG/Ion as well as the Sodium Malonate, and Ammonium Sulfate Grid Screens (Hampton Research, Aliso Viejo, Calif.) were used for the initial screening. Sitting-drop vapor-diffusion experiments were performed in 24 well microplates (Art Robbins Instruments, Sunnyvale, Calif.). Each well contained 300 μL of precipitant solution and drops were set using 1 μL of protein solution and 1 μL of precipitant. The plates were sealed with sealing film and incubated at 25° C. A crystallization condition of 0.1 M ammonium acetate, 0.1M Bis-Tris pH 5.5 and 17% polyethylene glycol 10,000 was found using the Hampton Research Index screen. Sitting-drop vapor-diffusion experiments were performed using this condition at protein concentrations of 6.5 mg/mL and 10 mg/mL and cubic crystals (0.5 mm on edge) were obtained in the drops containing CpXyn30A at a concentration of 6.5 mg/mL. Crystals were harvested for cryocrystallographic data collection by transferring them stepwise to solutions containing 5% (v/v), 10% (v/v) and 20% (v/v) glycerol in well solution. After the 20% glycerol transfer, the crystals were flash cooled and stored in liquid nitrogen until data collection.

Data Collection, Analysis and Model Building.

Data was collected on a Rigaku RU-H3R copper rotating anode generator, operating at 50 kV and 100 mA, fitted with Confocal Maxflux™ optics (Osmic Inc., Troy, Mich.) and a Rigaku R-Axis IV+ image plate detector. A 180° dataset was collected with 5 minute exposure times and a Phi oscillation of 0.5 degrees per image. The resulting data was processed to 2.01 Å and the crystal belonged to the orthorhombic space group, C222₁ with unit cell parameters: a=66.0, b=76.5, c=150.6 and a=β=γ=90.00. Data were indexed and integrated in iMosflm (Battye et al., 2011), scaled in SCALA (Evans, 2005), and initial molecular replacement phases, electron density map calculation and model building was performed with the programs Phaser (McCoy et al., 2007), Phenix (Adams et al., 2010) and Coot (Emsley et al., 2010), respectively. The final model (PDB code: 4FMV) was studied and figures prepared using PyMOL (DeLano, 2002).

Biochemicals and Assays.

All reagents were of the highest purity available. Xylooligosaccharides xylobiose (X2) and xylotriose (X3) were purchased from WAKO Chemicals (Richmond, Va.) and xylotetraose (X4), xylopentaose (X5) and xylohexaose (X6) were purchased from Megazyme International (Wicklow, Ireland). Concentrations of xylooligosaccharide standards were determined with the phenol-sulfuric total carbohydrate assay (Dubois et al., 1956). The aldouronate, aldopenturonate (GX4), with a GA residue substituted penultimate to the nonreducing terminus of xylotetraose was the aldouronate limit product of a GH11 xylanase (Biely et al, 1997) (Trichoderma longibrachiatum, XynII, Hampton Research, Aliso Viejo, Calif.) and was purified using a 1.7 m P-2 resin column (Bio-Rad, Hercules Calif.) in 50 mM formic acid. The Rotovap concentrated sugar was then loaded onto the same column equilibrated with water to remove the formic acid from the oligosaccharide. The desalted GX4 was lyophilized, dissolved in water and the concentration determined with the Blumenkrantz assay for total uronic acid content (Blumenkrantz & Asboe-Hansen, 1973).

Enzymatic Activity Measurements and Hydrolysis Product Studies.

Activity measurements on polymeric substrates were determined through reducing end quantification with the Nelson's Test (Nelson, 1944) as has been previously described (St. John et al., 2006). Conditions for hydrolysis by CpXyn30A were optimized using beech wood xylan (Sigma-Aldrich Corporation St. Louis, Mo.) in acetate buffers ranging in pH from 3-6. Thermostability was analyzed using enzyme pre-incubations at a range of temperatures from 4°-50° C. followed by activity assessment at 30° C. Activity measurements for functional comparison were performed using sweet gum wood glucuronoxylan (SGX) (kindly provided by James F. Preston from the University of Florida) and wheat arabinoxylan (WAX) (Megazyme International). Hydrolysis of xylooligosaccharides was determined using an Agilent 1260 HPLC (Agilent Technologies, Santa Clara, Calif.) with resolution of neutral xylooligosaccharides performed using a Phenomenex RNO column (Phenomenex Torrance, Calif. USA) with water as eluent at 0.3 ml/min flow and 75° C. or a Shodex SH1821 (Showa Denko America, New York, N.Y.) in 0.05% H₂S0₄ running at 0.8 ml/min and 75° C. In both cases, the refractive index of the eluate was monitored throughout the separations.

Hydrolysis reactions were performed in 25 μL volumes under optimized conditions (100 mM sodium acetate, pH 4.5, at 30° C.) with xylooligosaccharides at 12.1 mM; a concentration (of X6) that approximated that of the polysaccharide used in the reactions employing polymeric xylan substrates. Reactions were stopped by boiling the samples in a water bath for 5 minutes. For studies evaluation p-nitrophenol (pNP) conjugated xylooligosaccharides (pNPXn, where n=the number of xylose units) samples were injected onto an Agilent 1260 HPLC (Agilent Technologies, Santa Clara, Calif.) equipped with a Zorbax C8 column (Agilent Technologies) and were eluted with a 0-90% acetonitrile gradient in water (Eneyskaya et al., 2003) and absorbance of the eluate was monitored at 302 nm. All HPLC analyses were performed in triplicate using 5 μL injections. Thin layer chromatography (TLC) was performed as described previously (St. John et al., 2006; Bounias, 1980).

Amino Acid Sequence Studies.

Sequences were aligned with the program MAFFT (Katoh & Toh, 2008) and the alignment figure was generated with ESPript (Gouet et al., 2003). Domain prediction was done using the online Conserved Domains program (Marchler-Bauer et al., 2011) and the domain representation was created using the program DOG 1.0 (Ren et al., 2009). Phylogenetic relationships were calculated and represented using the software MEGA 6.0 (Tamura et al., 2013).

Selection of CpXyn30A.

Primary amino acid sequence alignments (FIG. 7a ) identified a unique GH30-8 enzyme that did not contain the normally conserved β8-α8 loop sequence. For this enzyme, (UniProt ID: F1TBY8) sequence studies verified the existence of the two conserved glutamate amino acid side chains which catalyze the double displacement reaction common to CAZy Clan A enzymes and identified a likely secretion signal sequence in addition to non-catalytic modules positioned C-terminal of the GH30-8 catalytic module. These include a family 6 carbohydrate binding module (CBM6) for binding soluble glucan and two dockerin domains presumably for interaction within a cellulosome assembly (FIG. 7b ). Combined, these features suggest that this enzyme may have a role in the degradation of the xylan component of lignoccllulosic biomass. Phylogenetic analysis of CpXyn30A verifies the most similar enzymes to be GH30-8 homologs from Gram positive organisms (FIG. 7c ).

Based on these findings and an interest in characterizing a GH30-8 enzyme with a nonconserved amino acid sequence in the β8-α8 loop, the coding sequence for the gene was ordered from DNA 2.0 with the Signal-P (Petersen et al., 2011) predicted secretion signal sequence replaced by an amino-terminal methionine and a hexahistidine tag appended to the new carboxy-terminus defined by the end of a GH30-8 sequence alignment effectively truncating the protein sequence before the predicted (Bateman et al., 2004) CBM6 and dockerin domains.

Structure of CpXyn30A.

Refinement and model quality statistics for CpXyn30A structure model are presented in Table 2.

TABLE 2 Data collection and refinement statistics of Clostridium papyrosolvens Xyn30A. PDB code 4FMV Wavelength (Å) 1.542 Resolution range (Å) 38.30-2.01 (2.08-2.01) Space group C2221 Unit cell parameters a, b, c (Å) 66.0, 76.5, 150.6 Total reflections 46801 (4031) Unique reflections 24506 (2121) Redundancy 1.9 (1.9) Completeness (%) 94.9 (85.7) Mean I/sigmaI 27.63 (12.32) Wilson B-factor (Å²) 16.05 R-merge 0.020 (0.058) R-meas 0.028 CC* 1.00 (0.996) R-work 0.1642 (0.1534) R-free 0.2010 (0.2341) CC(work) 0.949 (0.955) CC(free) 0.928 (0.708) Number of atoms 3228 Ligands 0 Waters 212 Protein residues 386 RMS (bonds, Å) 0.013 RMS (angles, °) 1.50 Ramachandran favored (%) 97 Ramachandran outliers (%) 0 Clash score 2.71 Average B-factor (Å²) 15.20 Solvent (%) 18.10 Statistics for the highest-resolution shell are shown in parentheses.

As expected, the overall structure of CpXyn30A is very similar to other GH30-8 enzymes with an RMSD of just 0.95 Å (all-atoms, Pymol: align) obtained when compared to the crystallographic structure of the canonical Gram-positive GH30 xylanase, BsXynC (FIG. 8a ). However, the structure of CpXyn30A in the β8/α8 catalytic core domain is notably different from BsXynC (PDB code: 3KL5) in the β1-α1, β2-α2 (FIGS. 8b & 8 c), and, most importantly in the CpXyn30A unique β7-α7 and β8-α8 loop regions (FIGS. 9a & 9 b).

In the β1-α1 loop region, the sequence of CpXyn30A is shorter by three amino acids compared to BsXynC and the Gram-negative bacterial GH30-8 enzyme, XynA, from Erwinia cluysanthemi (EcXynA). A conserved tryptophan residue (Trp25) positioned by this loop establishes a predicted −3 xylosyl binding subsite (St. John et al., 2011; Urbanikova et al., 2011). While the Cα position for this conserved tryptophan is nearly identical in both BsXynC and EcXynA, the position of Trp25 in CpXyn30A is shifted 3.4 Å towards the inside portion of the loop. In either case, the indole side chain of the tryptophan lies in a similar position in all three enzymes (FIG. 17). As expected it is unique in the β7-α7 and β8-α8 loop regions relative to BsXynC and also, as expected, unique compared to CpXyn30A

The β2-α2 loops of BsXynC and EcXynA are both very similar but the analogous region of CpXyn30A is considerably larger due to the presence of an additional nine amino acids in the α2 helix. In all three enzymes, a single aromatic amino acid stacking interaction is observed between residues in helices α2 and α3 (FIG. 8b ). The conserved interaction shared by these three xylanases consists of a phenylalanine (BsXynC and EcXynA) or tyrosine (Tyr124 in CpXyn30A) from the α3-helix in a perpendicular stacking arrangement with a tryptophan (Trp66 in CpXyn30A) extending from the α2-helix. The extended α2 helix unique to CpXyn30A provides two additional intramolecular contacts with adjacent regions of the enzyme.

In the first interaction, Trp58 of CpXyn30A overlays the α3-helix and stacks perpendicular to Tyr120 (FIG. 8b ). The second interaction found in the extended loop region is a hydrogen bond between Asp56 and Ser93 of the β4-α4 loop region. These last two contacts are not found in the BsXynC or EcXynA enzymes and may serve a role in supporting the beta-structured β3-α3 loop region as originally described in BsXynC (St John et al., 2011), but not in EcXynA.

A structural difference between GH30-8 enzymes from Gram-negative bacteria and those from Gram-positive bacteria (St. John et al, 2011) is found in the β3-α3 and β4-α4 loop regions (FIG. 8c ). In CpXyn30A, this region adopts a fold similar to the Gram-positive GH30-8 enzymes (BsXynC-like) with a small β-structure extending upward at the top of the β3-α3 loop and a region in the β4-α4 loop which supports this extended β-structure primarily through stacking interactions. This is in contrast to the Gram-negative homologs of these enzymes (EcXynA like) which lack the extended β3-α3 loop β-structure and instead relies solely on hydrogen bonding between the two loop regions for stabilizing contacts. These hydrogen bonds are not present in the Gram-positive examples of these enzymes (St. John et al., 2011).

The β7-α7 loop region (FIG. 9) of CpXyn30A also displays a significantly different structure relative to the BsXynC and EcXynA enzymes. In this loop region of BsXynC and EcXynA, two conserved amino acids (Tyr231 and Ser235 in BsXynC) establish hydrogen bonds with the C-2 and C-3 hydroxyl groups of the α-1,2-linked GA appended on the xylan chain (FIG. 9a ).

In CpXyn30A, this loop is smaller in size, but still has the conserved tyrosine (Tyr234). Following this amino acid, the loop region diverges slightly from the typical structure with Asp236 in place of a normally conserved serine and is positioned as to make a functionally similar contact unlikely. Despite this difference, it may be considered possible that the O-2 hydroxyl of sugars linked α-1,2 to the xylose in this subsite (typically GA) may hydrogen bond with Tyr234 as observed in the ligand bound crystal structures of BsXynC (FIG. 3a ) and EcXynA (St. John et al., 2011, 291; Urbanikova et al., 2011).

In the altered sequence of the β8-α8 loop, four of the GA coordinating contacts identified for the ligand bound BsXynC structure are no longer available (FIG. 3b ) (St. John et al., 2011). Surprisingly, despite the fact that the sequence of the β8-α8 loop region of CpXyn30A is completely different from the conserved sequence found in BsXynC and EcXynA, the structure of the loop does not significantly deviate from the Ca-trace of these model enzymes. This is most noteworthy since this region forms the basis for classification of the proteins into the GH30-8 subfamily due to its importance in GA recognition (FIG. 3b ).

Functional Characterization.

While CpXyn30A has measurable activity on glucuronoxylan, the specific activity is low relative to the characterized GH30-8 xylanases as well as other more common β-1,4-endoxylanases such as those from families GH10 and GH11. In consideration of this finding, other polymeric substrates were tested for activity. These included carboxymethylcellulose, barley β-glucan, yeast glucan, glucomannan, galactoglucomannan, xyloglucan and gum arabic, but in each case there was no detectable activity. The results presented in Table 3 indicate that CpXyn30A displays similarly low specific activity on all xylan substrates tested.

TABLE 3 Specific activity¹ comparison of CpXyn30A and BsXynC on xylans and xylooligosaccharides. Substrates Concentration CpXyn30A BsXynC Sweetgum 10.00 mg/ml 1.1 ± 0.1  70.7 ± 4.8 glucuron- 7.50 mg/ml 1.1 ± ≤0.1 61.7 ± 3.8 oxylan (SGX) Wheat 7.50 mg/ml 1.7 ± 0.2  nd² arabinoxylan (WAX) Xylo- 12.10 mM 1.19 ± ≤0.01     0.019 ± ≤0.002⁵ hexaose (X₆)⁴ Xylo- 12.10 mM 0.36 ± ≤0.01 ND³ pentaose (X₅)⁴ ¹Units/mg protein, where one Unit is defined as one μmole/minute of activity. Data results from triplicate measurements resulting from a single assay. These results were consistent with numerous previous analyses. The given error is represented by the standard deviation. ²nd = Not detected ³ND = Not determined ⁴The data for these substrates represent an evaluation of specific activity based solely on the decrease of substrate. These values are higher than the true specific activity as the described competing transglycosylation reaction presumable consumes two X₆ molecules. Xylohexaose was digested for 8 minutes and xylopentaose was digested for 20 minutes. ⁵The X₆ substrate concentration was only 10 mM for this reaction, a difference in the comparison which is considered inconsequential to this study.

Interestingly, specific activity was 57% greater on WAX than on SGX when measured at the same substrate concentration (7.5 mg/ml). Studies employing X6 as a substrate at 12.1 mM (roughly the molar equivalence of 10 mg/ml xylan) show that CpXyn30A exhibits a similar activity as with 10 mg/ml SGX a characteristic not previously observed for other GH30-8 enzymes (see below).

In TLC analysis of an overnight hydrolysate of SGX by CpXyn30A, X2, X3, X4 and the primary aldouronate, GX4 (aldopentauronate, FIG. 10A) were observed. However, without further studies, the configuration (i.e. GA substitution position) of this aldouronate product is unknown.

CpXyn30A also efficiently processed WAX with only low levels of X2 and X3 apparent following an overnight digestion, but numerous other spots were observed on the plate which did not align with any of our standards. This suggests they are arabinofilranose substituted xylooligosaccharides instead of neutral oligoxylosides. Hydrolysis of X6 and X5 resulted in a distribution of smaller xylooligosaccharides similar to those observed for glucuronoxylan hydrolysis. There was only slight hydrolysis of X4 observed (FIG. 10B) and no detectable hydrolysis of X3 in overnight reactions (FIG. 10B).

Activity measurements of BsXynC confirm the reported function of this enzyme as a glucuronoxylan xylanohydrolase which requires a substitution of α-1,2-linked GA residues for activity. Multiple attempts were made to obtain activity measurements for the hydrolysis of WAX by BsXynC, including one attempt which used a 10-fold greater amount of enzyme than that used in similar reactions employing CpXyn30A and an overnight reaction time. However, all results were generally too variable and close to zero to be reported as anything other than ‘not detected’ (Table 3).

In agreement with previous findings, it is observed that BsXynC activity on X6 (10 mM) was 3-orders of magnitude (6172 fold) lower than on SGX (at 7.5 mg/ml) (Urbanikova et al., 2011), supporting the requirement for the GA appendage for activity.

Parallel TLC studies of the reaction products generated by BsXynC also confirm our current understanding of these enzymes and have provided further insight to their specificity (FIGS. 10A & 10B). Hydrolysis of SGX by BsXynC yielded an array of aldouronate sugars (St. John et al., 2006; Vrsanska et al., 2007) while reactions containing WAX as substrate did not yield any detectable smaller sugars, a result supported by the lack of detectable enzymatic activity on this substrate in kinetic studies (Table 3). The TLC analysis of BsXynC hydrolysis of X6 visually confirms the results presented in Table 3 with a very low activity observed for hydrolysis of this substrate. In contrast to these observations, overnight hydrolysis of GX4 by BsXynC resulted in xylose and the smaller aldouronate, aldotetrauronate (FIG. 10B).

Xylooligosaccharide hydrolysis studies showed that CpXyn30A has a competing transglycosylase activity (Shaikh & Withers, 2008). For this to occur, following the nucleophilic attack on the anomeric carbon by the catalytic nucleophile (E232 in CpXyn30A) an enzyme-substrate complex is formed which, in retaining glycosyl hydrolase enzymes (such as CpXyn30A), typically resolves by release of the sugar from the enzyme through a water molecule activated by the other member of the catalytic acid/base pair (E143 in CpXyn30A) (Shaikh & Withers, 2008). This accepted, double-displacement reaction scheme generates smaller sugars from polymeric substrates. However, if a sugar molecule were to bind into the active site cleft instead of a water molecule, then the C-4 hydroxyl group of the non-reducing terminal residue may become activated, resulting in a transglycosylation reaction creating a new β-1,4-xylosidic bond instead. For retaining glycosyl hydrolases like GH30 enzymes, transglycosylation may occur as a product of a failed hydrolytic reaction.

Here, reaction mixtures consisting of X6 as the most abundant substrate were employed to probe the transglycosylase activity of CpXyn30A. If X6 binds so that it will be hydrolyzed into two molecules of X3, and a second X6 reoccupies the other half of the active site cleft, then the condensation of these two sugars will result in the formation of the xylooligosaccharide xylononaose (X9) (FIG. 11, inset). Because an initial hydrolytic activity is required to observe transglycosylation, the net reaction proceeds toward the right (smaller xylooligosaccharides) due to the eventual buildup of limit products which do not act as substrates for further endo-hydrolysis.

In the present study, hydrolysis of X6 by CpXyn30A results in what is predicted from HPLC chromatograms as xylodecaose (X10) and X9 as well as smaller xylooligosaccharides such as X2 through X4 (FIGS. 11 & 12). These data were confirmed by TLC analysis which showed that within 20 minutes of the start of the reaction, hydrolysis of X6 resulted in a spot with no mobility (est. DP degree of polymerization >8) and small amounts of X4, X3 and X2. Formation of X10 and X9 may occur through a transglycosylation when CpXyn30A cleaves X6 such that either a X3 or X4 is positioned in the glycone side of the substrate binding cleft in the enzyme substrate complex. Because xylooligosaccharides smaller than X5 are not hydrolyzed they cannot be a source for further transglycosylation.

Based on this analysis, specific activities (Table 3) are anticipated to be lower than reported as the enzyme catalyzed transglycosylation reaction consumes two molecules of X6. It is impossible to determine what the ratio of hydrolysis:transglycosylation reactions might be without quantification of X9 and X10.

Transglycosylation does not likely contribute substantially during initial hydrolysis of polymeric xylan as the concentration of reducing termini is much lower and unlikely to significantly compete as an acceptor through the limited reaction time. Interestingly, even though the model enzyme BsXynC only hydrolyzes neutral xylooligosaccharides such as X6 very slowly, the activity that was observed appears from TLC to resolve in part by transglycosylation, similar to CpXyn30A (FIG. 4b ). Similar results were previously reported for another GH30-8 Gram-positive enzyme (a highly conserved homolog from Bacillus sp. Strain BP-7) (Gallardo et al., 2010).

Our data on the rate of hydrolysis of X6 agrees with the previously reported level of activity of the Gram-negative GH30-8 enzymes EcXynA having 3-orders lower activity on this neutral xylooligosaccharide than on a polymeric glucuronoxylan substrate (Urbanikova et al., 2011). Transglycosylation can also be observed by TLC after the overnight digestion of GX. This indicates that BsXynC may be producing disubstituted aldouronates (FIG. 4B).

Clues as to the distinctive comparative function of these enzymes may be ascertained from their structures. Of the five hydrogen bonds and one salt bridge that have been described which establish the interaction between the β7-α7 and β8-α8 loops and the GA side chain in the BsXynC and EcXynA enzymes, only two hydrogen bonds are thought to still be possible in CpXyn30A. These positions, equivalent to Tyr234 and Trp265 in CpXyn30A may be available for hydrogen bonding with either GA or arabinofuranose substitutions linked α-1,2 on the main xylan chain. However, since activity measurements are similar on neutral xylooligosaccharides and xylans, it seems unlikely that this potential hydrogen bonding position plays a significant role in xylan hydrolysis. Instead, the β8-α8 loop contains larger, hydrophobic amino acids which would appear from inspection of a surface analysis to displace the xylose and any substitution in this position (−2 subsite). Because of this displacement, substitutions in this region most likely are beyond hydrogen bonding distance of Tyr234 and Trp265. The increased size of the β8-α8 loop may reorient the glycone bound xylan sugar out of an ideal orientation for hydrolysis.

From these data, CpXyn30A stands out as a defunct GH30-8 xylanase having no apparent specificity for O-2 linked GA substitutions and a greatly decreased specific activity on the usual glucuronoxylan substrate while simultaneously possessing a unique ability to hydrolyze WAX, SGX and the neutral xylooligosaccharide X6 at rates approximately 100-fold greater than BsXynC processing of neutral sugars. Even though CpXyn30A has a demonstrated xylanase activity, it is not known whether the enzyme represents an evolved functionality whose role has not yet been identified or a residual xylanase activity resulting from unbeneficial changes to the Xyn30A gene in C. papyrosolvans.

The data presented helps us understand the function of the GH30-8 β8-α8 loop in determination of the specificity of these enzymes. It seems clear that the conserved sequence of this loop found in the BsXynC/EcXynA enzymes may not only enable recognition of the α-1,2-linked GA appendage, but might also prevent binding of neutral sugars by physically obstructing access to the binding cleft.

Example 2: Expression and Purification of CaXynQ97

The codon optimized (for E. coli) coding sequence for CaXynQ97 including a C-terminal His-tag was synthesized by DNA 2.0 and inserted into their pJexpress 411 kanamycin selective expression vector (pCaXyn30A). Chemically competent E. coli BL21 (DE3) was transformed with the pCaXynQ97 expression vector and selected for on LB agar plates containing 50 ug/ml kanamycin. The following day a single colony was selected from the plate and inoculated into a 50 ml volume of LB media containing 50 ug/ml kanamycin contained in a 250 ml long-neck shake flask. This was grown overnight at 37° C. with shaking at 250 rpm. The following morning, 5 ml aliquots of this culture were used to inoculate several 37° C. preequilibarted 0.5 liter volumes of LB media containing 50 ug/ml kanamycin contained in a Fernbach flask. This was grown at 37° C. with shaking at 300 rpm until a measured OD 600 nm of approximately 0.7 and the culture was then induced by addition of IPTG to a final concentration of 1 mM. The induced culture was then grown for an additional 4-5 hours at 37° C. and shaking at 300 rpm. Following induction, the foil cap was kept in place for 1 hour, but removed for the remaining hours of induction. The cells were then collected through centrifugation at 10400×g (i.e., 8000 rpm in a GSA rotor) at 4° C. Each pellet resulted from 0.5 liter of expression culture. The pelleted cells were stored frozen at −80° C. until used for protein purification.

The His-tagged version of this enzyme was purified in a standard manner very similar to CpXyn30A (C7I) with use of a Ni-affinity IMAC chromatography column and subsequent gel filtration chromatography. After several years attempting to obtain protein crystals for crystallographic structure determination, we decided to reclone the CaXynQ97 enzyme from the pJexpress 411 vector into the pET28 protein expression (Novagen) with removal of the C-terminal His-tag. The new pET28 based expression construct (pCaXynQ97-nohis) transformed into E. coli BL21 (DE3) expressed very well as did the previous pJexpress construct. The preceding expression protocol and the following cell processing procedure apply to both constructs except that for the cell processing for the no-his-tag expression product was “beefed-up” with addition of lysozyme in the hope that it would make for a cleaner preparation since no affinity tag was being used. As will be explained it turned out that this purification was actually just as easy as an affinity system due to the inherent high isoelectric point of CaXyn30A.

For protein purification, four pCaXynQ97-nohis E. coli protein expression pellets were thawed at room temperature and then on ice. An EDTA-free Mini cOmplete protease inhibitor tablet (Roche) was added to one of these pellets. A volume of 8 ml of 25 mM Tris HCl pH 7.1 was added to each pellet and the soft pellets were resuspended using a glass rod and eventually, to obtain fluidity, with the action of α5 ml pipet. Resuspended pellets were combined and each of the four centrifuge tubes (250 ml volume) were rinsed with 2 ml of Tris buffer. Lysozyme was added to a final concentration of 20 ug/ml and the full volume (˜47.5 ml) was transferred to a 250 ml capacity glass sonication vial and allowed to cool on ice for 15 minutes. The sonic microtip was calibrated according to instructions and then submerged ˜1 inch below cell suspension surface. Set to 20% power (˜95 watts) and an approximate control knob setting of 3.5. Cycle twelve times of 10 seconds on, 50 seconds off in ice/water. Total process time is 12 minutes. Collect sonicate, and based on approximate volume add 1M MgCl₂ to a final concentration of 2 mM and lysozyme as added before. Add 250 Units of Benzonase (Novagen) and rock at room temperature for 30 minutes. Cell lysate volume is split into 2-45 ml Oakridge centrifuge tubes. Preequilibrate centrifuge to 15° C. and centrifuge cell lysates for 30 minutes at 15° C. at G-Force (i.e.; 14000 rpm (SS-34 rotor)). Collect supernatant cell free extract and filter through 0.45 um syringe tip filter.

The processing of both the His-Tag and no-His-Tag versions of CaXynQ97 used Tris based buffers as it was shown several times that this enzyme precipitates in the phosphate based buffers typically used for IMAC chromatography, as in the purification of CpXynC7I. The resulting cell free extract (CFE) of the CaXynQ97-nohis enzyme was fractionated on a 5 ml Econo-Pac CM column (Bio-Rad) equilibrated in pH 7.1 25 mM Tris HCl and a gradient to 500 mM NaCl. The fractionation was very clean and the eluted protein peaks were combined and concentrated using an Amicon Ultra 15 with 10K MWCO. This concentrated CaXynQ97 preparation was then desalted using a 5 ml Econo-Pac P-6 desalting column and then again concentrated with a another Amicon Ultra 15 10K MWCO centrifugal concentrator. It was noted that high concentrations of CaXynQ97-nohis did not seem to like the high salt, so prior to concentrating for subsequent desalting the enzyme was diluted with the Tris. The preparation was then purified on a Superdex 200PG 16/600 column (GE healthcare) equilibrated with 25 mM Tris HCl pH 7.5, 100 mM NaCl. Concentrated and buffer exchanged into 25 mM Tris HCl pH 7.1. Remaining NaCl is estimated at <5 mM. This form of CaXynQ97-nohis proved to be very soluble remaining is solution at concentrations greater than 100 mg/ml. Protein crystal screening was performed with a preparation at ˜40 mg/ml.

Functional Characterization.

Wheat arabinoxylan was digested with the same number of Units of activity of the three enzymes tested. Following a pre-established time the reaction was killed by heating to 90° C. for 10 minutes. The reaction volume was then adjusted to 90% ethanol by addition of 100% ethanol and allowed to precipitate overnight at 4° C. The resulting material was centrifuged at 20000×g for 30 minutes at 4° C. and the supernatant and the pellet were isolated. The pellet was washed with cold 100% ethanol and the supernatant was rotovaped to remove all the ethanol and then brought to a known volume. The pellet was resuspended to this same volume. Total reducing end was determined with the Nelson's test and total carbohydrate was determined with the Phenol Sulfuric assay. The degree of polymerization (DP) was calculated as the quotient of these two values and used to characterize the oligomeric state and hence information regarding the xylanases used.

Comparison of XynQ97 and XynC71.

Representative examples of the disclosed GH30-8 enzymes (XynQ97 and XynC71) differ slightly in their respective hydrolysis product profiles, but it is clear they are very similar when compared to the canonical GH30-8 enzyme family homolog XynC. They are shown to yield a series of small xylooligosaccharides and aldouronates indicating the enzymes have no obvious preference for the GA xylan chain appendage. This is further supported by the arabinoxylan limit hydrolysis by these enzymes, in comparison to XynC, which was unable to degrade this substrate in any way (FIG. 6 and FIG. 10A), the enzymes of the present invention readily processed this substrate to small xylooligosaccharides and small arbinofuranose substituted xylooligosaccharides.

Notably, based on the intensity of the remaining sample spot for the TLCs of FIG. 6 and FIG. 20 (bottom spots), the disclosed enzymes processed the arabinoxylan to smaller oligoxylosides and arabinofuranose xylooligosaccharides more efficiently than the GH10 xylanase XynA1CD indicating that the disclosed enzymes are better at cleaving xylan chains in regions of frequent side chain substitution.

XynC71, while it has a comparably low rate of hydrolysis (see Example 1) also appears to process wheat arabinoxylan just as efficiently as XynQ97, but glucuronoxylans to a slightly lesser degree (FIG. 6). Relative to the limit hydrolysis product profile of a typical GH10 xylanasc and a canonical GH30-8 appendage specific xylanase on either of these two xylans types, XynQ97 and XynC71 act in a similar fashion due to the altered β7-α7 and β8-α8 loops.

Our data suggests that the GH30-8 xylanases of the present invention are able to hydrolyze highly substituted xylan substrates where other xylanases are unable to function due to steric interaction between the enzyme and xylan side chain substitutions, thereby liberating more substituted xylooligomers and yielding more oligosaccharide sugar liberated from highly substituted often insoluble xylan substrates (Table 4).

TABLE 4 Comparison of XynQ97 with Industry Leading GHl0 Xylanase. Total Anal- Supernatant/ Total RE Carbohydrate Enzyme ysis Pellet (umoles) (umoles) DP XynQ97 1 Supernatant 32.87 218.15 6.63 Pellet 0.191 5.14 27.1 2 Supernatant 27.00 191.69 7.1 Pellet 0.298 5.26 17.7 Industry 1 Supernatant 39.48 230.9 5.84 Leading Pellet 0.232 9.92 42.84 GH10 1 Supernatant 39.19 184.1 4.69 Xylanase Pellet 0.750 18.78 21.12 PbXyn10A1CD 2 Supernatant 28.6 175.81 6.14 Pellet 1.53 36.01 23.57

Example 3. Methods of Using the GH30-8 Xyalanases to Reduce Biomass

The enzymes and enzyme compositions of the present invention can be used to hydrolyze biomass materials or other suitable xylan containing feedstocks.

The GH30-8 xylanases and compositions thereof provided above are useful in reducing xylans found in any source, including biomass, due to their unique structure. Specifically, without the canonical GH30-8 subfamily specific loop regions which define the GA appendage specificity of these enzymes, the GH30-8 subset of enzymes presently disclosed are considered free of such restrictions or limitations and therefore appears to function more generally as a β-1,4-endoxylanase, not having clear preference toward xylan side chain appendages. These GH30-8 xylanases of the present invention are therefore more comparable to the very common β-1,4-endoxylanases of glycoside hydrolase families 10 and 11 while simultaneously being unique from these by yielding unique hydrolysis product profiles and obtaining better liberation of oligomers from highly substituted xylans.

These “generic” GH10 and GH11 endo-β-1,4-xylanases process xylans by working around the appendages like obstacles which prevent the enzyme from accessing the xylan chain. In doing this the GH10 endoxylanase is known to produce the smallest of the GA-substituted xylooligosaccharides (aldotetrauronate). This is because the substrate binding cleft of these xylanases can accommodate 2-GA appendages separated by just two xyloses (−3 and +1 subsites).

Using the canonical GH30-8 xylanase as a comparison it is known that these enzymes specifically bind the GA in the −2 subsite (FIGS. 2 & 3). Further, the limit hydrolysis product analysis yields the smallest GA substituted xylooligossacharide as aldotriuronate (see FIG. 6) resulting from a digest of beechwood glucuronoxylan by XynC). This indicates that the GH30-8 xylanases of the present invention appear likely to accommodate 2 GA appendages separated by just one xylose.

This is supported by the limit hydrolysis of both beechwood xylan and wheat arabinoxylan by XynQ97 (FIG. 6). For the enzymes representing the GH30-8 and GH10 xylanase families, the limit products show the expected results from many previously reported biochemical characterizations. For GH30-8 xylanases, hydrolysis of a glucuronoxylan results in singly substituted aldouronates of varying lengths, each containing a GA substitution penultimate to the reducing terminal xylan as described above (FIG. 1). Hydrolysis of arabinoxylan by this enzyme yields no detectable hydrolysis products as this substrate does not contain any GA substitutions. Hydrolysis of these different xylans by XynA1 also yields results that are expected from previous characterizations. On a glucuronoxylan, a GH10 enzyme will hydrolyze the polymer primarily to xylobiose with smaller amounts of xylose and xylotriose along with the primary limit aldouronate product, aldotetrauronate (FIG. 21A).

Example 4. Selection of GH30-8 Xylanases

We sought to select several additional GH30-8 xylanases of the present invention and confirm their xylan hydrolysis product profiles. Based on qualitative assessment of the uniqueness of the amino acid sequence of the β7-α7 and β8-α8 loop regions as observed in FIG. 5, we selected UniProt accession numbers COIQA1, H1YFT8 and E4T705 for further characterization. The DNA sequence of the GH30-8 with accession number C0IQA1 derived from the nematode Meloidogyne incognita (Mi, MiXyn30A) was synthesized for cloning into an expression construct. This was done by the company DNA2.0 (Menlo Park, Ca). The DNA coding sequence was optimized for expression in E. coli containing no secretion signal sequence, with addition of a C-terminal His tag for affinity purification and the synthesized fragment cloned into their pJ411 protein expression vector. For the protein coding sequence with accession number H1YFT8 which derives from bacterium Mucilaginibacter paludis (Mp, MpXyn30A) and accession number E4T705 from the bacterium Paludibacter propionicigenes (Pp, PpXyn30A), genomic DNA for these two bacteria was ordered from DSMZ microbial stock center in Germany. The genes coding for these two enzymes were PCR amplified and the products cloned into the pET28 E. coli expression vector creating a fusion with the vector encoded C-terminal His-tag for affinity purification of the protein expression product.

In addition to these we also obtained the complete enzyme for which Q97 was the representative catalytic domain. This protein is referred to as Q97_RCN as the additional C-terminal portion of the native amino acid sequence encodes a carbohydrate binding domain of family 13 (CBM13, Ricin-like domain, RCN). This expression construct was obtained from DNA2.0 just as the original Q97 expression construct was obtain. However, this new expression construct did not include an affinity purification tag. The Q97_RCN protein was considered important to show that the enzyme still functions the same with respect to hydrolysis product profiles. Although, untested at this point, it is expect that inclusion of the native CBM13 domain of Q97 may improve upon functional features such as hydrolysis reaction rate, better performance in disruption of insoluble xylans, and provide better commercial qualities such as an increased temperature optimum and reaction stability.

For these four new protein expression constructs, expression was quickly optimized for different growth temperatures using laboratory auto-induction procedures. It was shown that MiXyn30A did not result in a soluble protein expression product. This was confirmed by the lack of a recoverable, near-pure protein of the correct molecular weight size by nickel immobilized metal affinity chromatography (IMAC) followed by SDS-PAGE (FIG. 19). Just to verify, the “peak region” for this IMAC elution was desalted and concentrated to an estimated protein concentration of 0.3 mg/ml. This was used in a 50 ul overnight xylan digestion reaction intended for TLC analysis (FIG. 20). For this protein, no hydrolysis of xylan is observed, most likely due to not obtaining a soluble MiXyn30A protein. Most likely this protein is not soluble as it is a eukaryotic enzyme which may, in its native nematode host, be glycosylated, a feature which may enhance solubility and other biophysical features. The MpXyn30A enzyme expression was also not very good, but IMAC chromatography resulted in a relatively pure protein in the expected size range by SDS-PAGE (FIG. 19). Hydrolysis product analysis by TLC showed only a very low level of hydrolysis occurred in the overnight reaction (FIG. 20). Notably, the barely detectable larger oligosaccharides produced by this enzyme hydrolysis suggest that MpXyn30A was not an efficiently functioning xylanase and this is likely due to the specific sequence of the β7-α7 and β8-α8 loop regions. Our original thinking, that changes to these two regions in the GA-dependent (canonical) GH30-8 xylanases could nullify the function of these enzymes was valid. We took a risk with a strong possibility that no activity would be detectable and discovered a novel, broad-specificity xylanase activity. Just opposite of this finding, the PpXyn30A protein, although it also did not express well, resulted in the best yield of pure protein following the IMAC purification. This is nicely seen from the SDS-PAGE analysis (FIG. 19). Further, an overnight reaction with this protein yielded a hydrolysis product profile nearly identical to Q97 and C7I. This again bolsters our claim regarding these GA-independent GH30-8 xylanases (FIG. 20).

The Q97_RCN protein expresses the best of all the other three, catalytic domain only proteins. Although expression is good the vast majority of the expressed protein at all tested expression temperatures is in inclusion bodies. Still, purification of the soluble form yielded plenty for biochemical studies (FIG. 19). TCL analysis of a xylan hydrolysis confirmed that the modular enzyme functioned just as the isolated catalytic domain of Q97 (FIG. 20).

Other embodiments and uses of the invention will be apparent to those skilled in the art from consideration from the specification and practice of the invention disclosed herein. All references cited herein for any reason, including all journal citations and U.S./foreign patents and patent applications, are specifically and entirely incorporated herein by reference. It is understood that the invention is not confined to the specific reagents, formulations, reaction conditions, etc., herein illustrated and described, but embraces such modified forms thereof as come within the scope of the following claims.

SEQUENCE LISTING

This specification includes the sequence listing that is concurrently filed in computer readable form. This sequence listing is incorporated by reference herein. 

We claim:
 1. A method of hydrolyzing or digesting a lignocellulosic biomass material comprising; contacting the lignocellulosic biomass material with a glycoside hydrolase family 30 subfamily 8 (GH30-8) enzyme composition comprising an isolated polypeptide, wherein the polypeptide is a glucuronic acid (GA)-independent GH30-8 enzyme comprising the amino acid sequence (W or Y)(W or F)W(I or V or F)(not R)(not R) (SEQ ID NOs:69-80) within the β8-α8 loop of the enzyme, wherein the lignocellulosic biomass material comprises hemicellulose, cellulose, or mixtures thereof.
 2. The method of claim 1, wherein the lignocellulosic biomass material comprises an agricultural crop, a byproduct of a food/feed production, a lignocellulosic waste product, a plant residue, or waste paper.
 3. The method of claim 1, wherein the lignocellulosic biomass material is subjected to a pretreatment.
 4. The method of claim 3, wherein the pretreatment comprises a thermal, aqueous or thermomechanical pulping.
 5. The method of claim 3, wherein the pretreatment comprises an acidic pretreatment or a basic pretreatment.
 6. The method of claim 1, wherein the GA-independent GH30-8 enzyme comprises an amino acid sequence at least 85% identical to an amino acid sequence selected from the group consisting of SEQ ID NO:1 residues 33-420 (Q97TI2), SEQ ID NO:2 residues 32-421 (F1TBY8), SEQ ID NO:3 residues 45-423 (E4T705), SEQ ID NO:4 residues 33-399 (H1YFT8), SEQ ID NO:32 (COIQA1), SEQ ID NO:33 (B3TJG3), SEQ ID NO:34 (G7M3Z8), SEQ ID NO:35 (M1N0D3), SEQ ID NO:36 (F7ZYN8), and SEQ ID NO:37 (F0KEL6).
 7. The method of claim 6, wherein the GA-independent GH30-8 enzyme comprises an amino acid sequence that is at least 90% identical to the selected amino acid sequence.
 8. The method of claim 7, wherein the GA-independent GH30-8 enzyme comprises an amino acid sequence that at least 95% identical to the selected amino acid sequence.
 9. The method of claim 1, wherein the GH30-8 enzyme composition further comprising at least one additional protein having enzymatic activity. 